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The induction of a signaling pathway is characterized by transient complex formation and mutual 
posttranslational modification of proteins. To faithfully capture this combinatorial process in a math- 
ematical model is an important challenge in systems biology. Exploiting the Umited context on which 
most binding and modification events are conditioned, attempts have been made to reduce the com- 
binatorial complexity by quotienting the reachable set of molecular species, into species aggregates 
while preserving the deterministic semantics of the thermodynamic limit. Recently we proposed a 
quotienting that also preserves the stochastic semantics and that is complete in the sense that the 
semantics of individual species can be recovered from the aggregate semantics. In this paper we 
prove that this quotienting yields a sufficient condition for weak lumpability and that it gives rise to a 
backward Markov bisimulation between the original and aggregated transition system. We illustrate 
the framework on a case study of the EGF/insulin receptor crosstalk. 

1 Introduction 

Often a few elementary events of binding and covalent modification fST] in a biomolecular reaction 
system give rise to a combinatorial number of non-isomorphic reachable species or complexes jT/lfTSl . 
Instances of such systems are signaling pathway, polymerizations involved in cytoskeleton maintainance, 
the formation of transcription factor complexes in gene-regulation. 

For such biomolecular systems, traditional chemical kinetics face fundamental limitations, that are 
related to the question of how biomolecular events are represented and translated into a mathematical 
model [24] . More specifically, chemical reactions can only operate on a collection of fully specified 
molecular species and each such species gives rise to one differential equation, describing the rate of 
change of that species' concentration. Many combinatorial systems do not permit the enumeration of all 
molecular species and thus render their traditional differential description prohibitive. However, even if 
one could enumerate them, it remains questionable whether chemical reactions are the appropriate way 
to represent and to reason about such systems. 

As the dynamics of a biomolecular reaction mixture comes about through the repeated execution of a 
few elementary events one may wonder about the effective degrees of freedom of the reaction mixture's 
dynamics. If the velocity of all events - or their probabilities to occur per time-unit per instance - 
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are different for all complexes (w.r.t. modification) and pairs of complexes (w.r.t. binding) to which 
the events can apply to, then the degrees of freedom would equal to the number of molecular species. 
However, due to the local nature of physical forces underlying molecular dynamics, the kinetics of most 
events appear to be ignorant with respect to the global configuration of the complexes they are operating 
on. More provocatively, one may say that even if there would be variations of kinetics of an event from 
one context to another, experimental biology does not - and most likely never will - have the means to 
discern between all different contexts. For instance, fluorescence resonance energy transfer (FRET), may 
report on a specific protein-binding event and even its velocity, however we have no means to determine 
whether the binding partners are already part of a protein complex - not to speak of the composition and 
modification state of these complexes. To this end, molecular species remain elusive and appear to be 
inappropriate entities of descriptions. 

To align with the mentioned experimental insufficiencies and with the underlying biophysical lo- 
cality, rule-based or agent-based descriptions were introduced as a framework to encode such reaction 
mixtures succinctly and to enable their mathematical analysis ll8l|3l. Rules exploit the limited context 
on which most elementary events are conditioned. They just enumerate that part of a molecular species 
that is relevant for a rule to be applicable. Thus, in contrast to chemical reactions, rules can operate on a 
collection of partially specified molecular species. Recently, attempts have been made to identify the set 
of those partially specified species that allow for a self-consistent description of the rule-set's dynamics 
IITII6II. Naturally, as partially specified species - or fragments - in general encompass many fully speci- 
fied species, the cardinality of that set is less than of the set of molecular species. These approaches aim 
to obtain a self-consistent fragment dynamics based on ordinary differential equations. It represents the 
dynamics in the thermodynamic limit of stochastic kinetics when scaling species multiplicities to infinity 
while maintaining a constant concentration (multiplicity per unit volume) [221 . In many applications in 
cellular biology this limiting dynamics is an inappropriate model due to the low multiplicities of some 
molecular species - think of transcription factor - DNA binding events. We recently showed that the 
obtained differential fragments cannot be used to describe the finite volume case of stochastic kinetics 
lfT2l . Exploiting statistical independence of events we instead derived stochastic fragments that repre- 
sent the effective degrees of freedom in the stochastic case. Conceptually, the procedure replaces the 
Cartesian product by a Cartesian sum for statistically independent states. In contrast to the differential 
case, stochastic fragments have the important property that the sample paths of molecular species can be 
recovered from that of partially specified species. 

We believe that interdisciplinary fields, such as systems biology, can move forward quickly enough 
only if the well-established knowledge in each of the disciplines involved is exploited to its maximum, 
i.e. if the standardized, well-established theories are recognized and re-used. For that reason, in this 
paper we translate our abstraction method (|12|) into the language of well-established contexts of ab- 
straction for probabilistic systems - lumpability and bisimulation. Lumpability is mostly considered 
from a theoretical point of view in the theory of stochastic processes |[T9llT3ll28ll26l 1271 141. A Markov 
chain is lumpable with respect to a given aggregation (quotienting) of its states, if the lumped chain pre- 
serves the Markov property |21 1. A sound aggregation for any initial probability distribution is referred 
to as strong lumpability, while otherwise it is termed weak lumpability [4^^ 2^. Approximate aggregation 
techniques for Markov chains of biochemical networks are discussed in llT4ll . Probabilistic bisimulation 
was introduced as an extension to classic bisimulation in ll23l . It is extended to continuous-state and 
continuous-time in |9| and, for the discrete-state case, to weak bisimulation [2J. For instance, in ||9l 
the authors use bisimulation of labelled Markov processes, the state space of which is not necessarily 
discrete, and they provide a logical characterization of probabilistic bisimulation. Another notion of 
weak bisimulation was recently introduced in lITOll . Therein two labeled Markov chains are defined to 
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be equivalent if every finite sequence of observations has the same probability of occurring in the two 
chains. Herein we recognize the sound aggregations of (12] as a form of backward Markov bisimulations 
on weighted labeled transition systems (WLTS), and we show it to be equivalent to the notion of weak 
lumpability on Markov chains. 

The remaining part of the paper is organized as follows. In the Section [2j we introduce weighted 
labeled transition systems (WLTS) and we assign it the trace density semantics of a continuous-time 
Markov chain (CTMC). Moreover, we define the Kappa language, and we assign a WLTS to a Kappa 
specification. Based on the notion of the annotated contact map we briefly summarize in Section [3] the 
general procedure to compute stochastic fragments, as it is offered in llT2l . In Section [4j we introduce 
the characterizations of sound and complete abstractions on WLTS as a backward Markov bisimulation. 
Moreover, we define it being equivalent to the weak lumpability on Markov chains. Finally, we provide 
in Section [5] results for the achieved dimensionality reduction for a rule-based model of the crosstalk 
between the EGF/insulin signaling pathway 0. This mechanistic model comprises 76 rules giving rise 
to 42956 reactions and 2768 molecular species. 

2 Preliminaries 

The stochastic semantics of a biochemical network is modelled by a continuous-time Markov chain 
(CTMC). The main object that we will use in the analysis is the weighted labelled transition system 
(WLTS) on a countable state space. We will assign a WLTS to a given Kappa specification, and we 
manipulate that object when reasoning about abstractions. 

2.1 CTMC 

We will observe the CTMC that is generated by the weighted labelled transition system (WLTS) on a 
countable state space. We define the CTMC of a WLTS, by defining the Borel a-algebra containing all 
cylinder sets of ti^aces [20] that can occur in the system, and the corresponding probability distribution 
among them. We also introduce the standard notation of a rate matrix, which we will use when analysing 
the lumpability and bisimulation properties in Sec.|4] 

Definition 1. (WLTS) A weighted-labelled transition system W is a tuple ,^,w, TTq), where 

• ^ is a countable state space; 

• ^ is a set of labels; 

• w.^x^xl^^ Mq is the weighting function that maps two states and a label to a real value; 

• TTo : =^ — >■ [0, 1] is an initial probability distribution. 

We assume that the label fully identifies the transition, i.e. for any x G ^ and I € there is at most one 
x' S J2r, such that w{x,l,x') > 0. Moreover, we assume that the system is finitely branching, in the sense 
that (i) the set {x €z ^ \ 7lo{x) > 0} is finite, and (ii)for arbitrary x G the set {{l,x') £ ^ x ^ \ 
w{x,l,x') > 0} is finite. 

The activity of the state x„ denoted a : J^T — t- Mq is the sum of all weights originating at xu i.e. 

a{xi) := ^{w{xi,l,Xj) \ xj G I G 

The definition of a WLTS implicitely defines a transition relation — s-C ^ x such that {xi,Xj) G— 
if and only if there exists a non-zero transition from state Xj to state xj, i.e. the total weight over all labels 
is strictly bigger then zero, written Y.{w{xi,l,Xj) \ I G =Sf } > 0. Moreover, we can differentiate the initial 
set of states J' C 3^ , such that their initial probabilities are positive, i.e. .J' = {x£ !X \ 7ro(x) > 0}. 
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Definition 2. (Rate matrix of a WLTS) Given a WLTS W = ,^,w, tTq), we assign it the CTMC rate 
matrix /? : <^ x ^ — > R, given by R{xi,Xj) = T^{w{xi,l,Xj) \ I G 

The consequence is that we do not enforce /?(x,,x,) = — I ' / 7}> it is usual for the 

generator matrix of CTMCs. This however does not affect the transient, not the steady-state behavior 
of the CTMC [1|. We do so for the following reason. When abstracting the WLTS by partitioning the 
state space, we get another WLTS. If the two states x and x' which have a transition between each other 
were aggregated in the same partition class x, it will result as a prolongation of the residence time in the 
abstract state x, i.e. we will have a self-loop in the abstract WLTS. 

If we refer to the generated stochastic Markov process, written as a continuous-time random variable 
over the countable state space We write Pr(X, = x,), the probability that the process takes 
the value x,- at time point t. It thus holds that Pr{Xo = x,) = 7io{xi), and Pr(Xf+d? = \ Xt = x,) = 
/?(x;,xy)df when / 7^ 7, whereas Pr(X,+df = -^i I X; = x,) = /?(x,,x;)df + (1 — ^{/?(x;,xy/)df | Xf £ J^}), 
which gives after simplification: Pr{Xt+dt = \ Xt = x,) = 1 — X^{/?(x,,x/)df | / 7^ /}. 

We define the cylinder sets of traces that can be observed in the system W. By observing the trace at 
a certain time point, we mean observing the sequence of visited states, labels that were assigned to the 
executed transitions, and time points of when the transition happened. 

Definition 3. (A trace of a WLTS) Let us observe the WLTS W = ,^,w, tTq) <^nd its CTMC. Given a 
number k in N, we define a trace of length k as T £ x ^ X R+f X written 

T=Xo Xi...Xi._i Xk. 

If the trace T is such that (i) 7lo{xo) > 0, and (ii)for all i, < i <k, we have that w(x,,/,-,x,+i) > 0, then 
we say that T belongs to the set of traces ofW, and we write it T £ S^{W\ 

The 'time stamps' on each of the transitions denote intuitively the absolute time of the transition, 
from the moment when the system was started it = 0). We cannot assign the probability distribution to 
the traces in since the probability of any such trace is zero. We thus introduce the cylinder set of 

traces over intervals of times. 

Definition 4. ( Cylinder set of traces) If IE. is the set of all nonempty intervals in Mq , then we define the 
cylinder set of traces Tug e (.^ x ^ x IM)*^ x such that: 

TiK = Xo H' Xl . . . X/t_ 1 Xk (1) 

denotes the set of all traces T = xq x\ . . .Xk-\ x^, such that tj S /,-, 1 < / < A:. If the cylinder 

of traces Tm. such that 71o{xq) > 0, and for all i = 0,...,k — 1, we have that w{xi,li,Xi-\-i) > 0, then we 
say that Tjir belongs to the cylinder set of traces ofW, and we write Tm G =^r(^)- 

Let Q.{^S/m.(W)) be the smallest Borel a-algebra that contains all the cylinder sets of traces in 
^mJy^)- We define a probability measure over in the following way. 

Definition 5. (Trace density semantics on a WLTS) Given a WLTS {^,.Sf, w, TTq), and a number k in N, 
the probability of the cylinder set of traces Tm £ ^r{'^)> specified as in expression ([T|, is given by: 

= n{xo H XI . . .x,_i H Xk) = 7ro(xo) fl _ ,-(..-O sup(/,)\ . 

f^j a{xi-i) V y 

Note that /^.a(x,_i)e""(^'-i) 'd? = ^-"{j^'-O infW) _ g-a(jf;-i) sup(/,) t^e probability of exiting the state 
x,-i in a time interval since the probability density function of the residence time of x;_i is equal to 
a(x;_i)e-^(-'^'-i). 
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2.2 Kappa 

We present Kappa in a process-like notation. We start with an operational semantics, then define the 
stochastic semantics of a Kappa model. 

We assume a finite set of agent names jaf, representing different kinds of proteins; a finite set of 
sites =5^, corresponding to protein domains; a finite set of internal states I, and I^iXx two signature maps 
from to p{y), listing the domains of a protein which can bears respectively an internal state and a 
binding state. We denote by E the signature map that associates to each agent name Ae^ the combined 
interface £i(A) U£;l(A). 

Definition 6. (Kappa agent) A Kappa agent A(a) is defined by its type A G ^ and its interface o. In 
A{c), the interface (J is a sequence of sites s in E(A), with internal states (as subscript) and binding 
states (as superscript). The internal state of the site s may be written as Sg, which means that either it 
does not have internal states (when s G S(A) \ Z; (A) ), or it is not specified. A site that bears an internal 
state m Glis written s^ ( in such a case 5 G Si (A) ). The binding states of a site s can be specified as s^, 
if it is free, otherwise it is bound (which is possible only when s G ^.^{A)). There are several levels of 
information about the binding partner: we use a binding label i G N when we know the binding partner, 
or a wildcard bond — when we only know that the site is bound. The detailed description of the syntax 
of a Kappa agent is given by the following grammar: 



a 
N 
c 



:= N{a) (agent) s 
:= A G J2/ (agent name) n 
:= e I s,c (interface) I 

X 



nf (site) 

xG 5^ ( site name ) 

e I mGl (internal state) 

e I — I j G N (binding state) 



We generally omit the symbol e. 

Definition 7. (Kappa expression) Kappa expression E is a set of agents A(c7) and fictitious agents 0. 
Thus the syntax of a Kappa expression is defined as follows: 

E ::= £ I a , E \ id , E. 

The structural equivalence =, defined as the smallest binary equivalence relation between expressions 
that satisfies the rules given as follows: 

, „ , , , „ , j, /■ G N and / does not occur in E 
E,A{g,s,s',g'),E' = E ,A{g,s',s,g'),E' ^ E\ili\^E 

E ,a,a' ,E' = E ,a' ,a,E' ■ ^ ■ , 

^ ^ „ ? G N and i occurs only once m E 

E = E , <d — -j 

E[e/i] = E 

stipulates that neither the order of sites in interfaces nor the order of agents in expressions matters, that 
a fictitious agent might as well not be there, that binding labels can be injectively renamed and that 

dangling bonds can be removed. 

Definition 8. (Kappa pattern. Kappa mixture) A Kappa pattern is a Kappa expression which satisfies 
the following five conditions: (i) no site name occurs more than once in a given interface; (ii) each site 
name s in the interface of the agent A occurs in E(A); (Hi) each site s which occurs in the interface of 
the agent A with a non empty internal state occurs in Z( (A); ( iv ) each site s which occurs in the interface 
of the agent A with a non empty binding state occurs in 11;^, (A); and (v) each binding label i G N occurs 
exactly twice if it does at all -there are no dangling bonds. A mixture is a pattern that is fully specified, 
i.e. each agent A documents its full interface 11(A), a site can only be free or tagged with a binding label 
? G N, a site in Ej (A) bears an internal state in I, and no fictitious agent occurs. 
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Definition 9. (Kappa rule) A Kappa rule r is defined by two Kappa patterns E( and Er, and a rate 
k G Mq , and is written: r = E( ^ Er@k. 

A rule r is well-defined, if the expression Er is obtained from E( by finite application of the following 
operations: (i) creation (some fictitious agents are replaced with some fully defined agents of the form 
A(a), moreover o documents all the sites occurring in r(A) and all site in £( (A) bears an internal state 
in 1), (ii) unbinding (some occurrences of the wild card and binding labels are removed), (Hi) deletion 
(some agents with only free sites are replaced with fictitious agent 0j, ( iv) modification (some non-empty 
internal states are replaced with some non-empty internal states), (v) binding (some free sites are bound 
pair-wise by using binding labels in Nj. 

From now on, we assume all rules to be well-defined. We sometimes omit the rate of a rule. More- 
over, we denote hy E^ -(r^ E,-@k\, k2 the two rules Ef ^ E,. ^ @k\ and Er ^ E(@k2. 

Definition 10. (Kappa system) A Kappa system !M = (K'lf ,{ri, . . . ,r„}) is given by finite distribution 
over initial mixtures %^ : {Mq, , . . . ,jWoj,} — [0, 1], and a finite set of rules {ri , . . . , r„}. 

In order to apply a rule r :=Ei^ Er@k\.o ^ mixture M, we use the structural equivalence = to bring 
the participating agents to the front of E (with their sites in the same order as in E(), rename binding 
labels if necessary and introduce a fictitious agent for each agent that is created by r. This yields an 
equivalent expression E' that matches the left and side (Ihs) Ff, which is written F ^ F^ as defined as 
follows: 



F 

■ aii f\E \= Ei 
01 



E ^ae , Eg 



s\= s^Acj 



=> N{a) 
a 1= £ 



S,0 \=S(;,Oi 



1= If aA \=?ii 
li G {e,i} 



I 1= le 



Note that in order to find a matching, we only uses structural equivalence on F, not Ff. We then 
replace E' by E'[Er] which is defined as follows: 

E[e]=E N{a)[N{ar)]=N{a[ar]) =„f[Jl 
{a , F)[<3,- , Er] = a[ar] , F[F,-] a[e] = a G I =^ i[ir] = ir 

®[ar]=ar {s,a)[sr,Or] = s[sr],a[ar] G NU{£} =^> = 
a,-[0]=0 X[-] = X 

This may produce dangling bonds (if r unbinds a wildcard bond or destroys an agent on one side of a 
bond) or fictitious agents (if r destroys agents), so we use = resolve them. 



2.2.1 Population-based stochastic semantics 

In addition to the rate constants k, careful counting of the number of times each rule can be applied to 
a mixture is required to define the system's quantitative semantics correctly ||7]|25l. Thus we define the 
notions of embedding between a mixture and an expression. Let Z = ai , ... , a^ and = ci , . . . be 
two patterns with no occurrence of the fictitious agent and such that there exists a pattern Z' = bi, . . . ,bm 
that satisfies both Z = Z! and Z! \= Zf (and so, in particular, n < m). 

The agent permutations used in the proof that Z = Z! allow us to derive a permutation p such that 
ap(,) = bi. The restriction of p to the integers between 1 and n is called an embedding between Zi and 
Z. This is written Z^ Z. There may be several embeddings between Z^ and Z for the same Z'; if so, 
this influences the relative weight of the reaction in the stochastic semantics. We denote by [Z,Z'] the set 
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of embeddings between Z and Z'. This notion of embedding is extended to patterns (including fictitious 
agents) by defining Z/ <|0 Z if, and only if, (4,0 Z^) <^ (J,© Z), where Iq removes all occurrences of the 
fictitious agent in patterns. 

We assume that E( is the Ihs of a rule r:= Eg ^ Er@k and Z is a mixture such that Eg Z. Let 
Z = ai,. . . ,am and Eg = ci,. . . Cn- Given Z' = Z (we write I0 Z' = Z^i , . . . , b,n) and a bijection p such 
that we have Z' |= E^, bj = tjpj,) for 1 < / < m and ^{j) = for 1 < j < n. The result of applying r 
along (j) to the mixture Z is defined (modulo =) as any pattern that is =-equivalent to Z'[Er]. In other 
words the embedding <j> between Eg and Z fully defines the action of r on Z up to structural equivalence. 

We are now ready to define the stochastic semantics by the mean of a WLTS. In this semantics, the 
state is a soup of agents, that is to say that we do not care about the order of agents in mixture. So the 
states of the system are the class of =-mixture. 

Defining species as connected mixture, the state of the system can be seen as a multi-set of species. 
The formal definition of a Kappa species is as follows: 

Definition 11. (Kappa species) A pattern E is reducible whenever E = E',E" for some non-empty pat- 
terns E', E"; A Kappa species is the —equivalence class of a irreducible Kappa mixture. 

As explained earlier, the action of a rule r on a mixture E is fully defined (up to =) by an embedding 
between the Ihs Eg of the rule r and the mixture. So as to consider computation steps over =-equivalent 
of mixtures, we introduce an equivalence relation =_sf over triples (r,^, 0) where is an embedding of 
the Ihs Eg of r into E. We say that (n, £1,(^1) = ^ (r2,£'2,<fc) if, and only if, (i) ri = r2 and (ii) there 
exists an embedding l/A G [£1 , £2] such that <j>2 = y/oipi. 

Definition 12. (WLTS of a Kappa system) Let ^ = (TTff , {ri , . . . , r,,}) be a Kappa system. We define the 
WLTS 'Wp^ = ( J^T, w, TTo) where: ( i) ^ is the set of all —equivalent classes of mixtures; ( ii) ^ is the 
set of all ~(£ -equivalence classes of triples (^r,E,^) such that is an embedding between the Ihs Eg ofr 

k 

andE; (Hi) w{x,l,x') = whenever there exist a ruler = Eg -^Er@k, two mixtures E and E' , and an 

\[Et,Et]\ 

embedding G [Eg,E], such thatx = [E]=, I = [r,^, 0] = j», andE' is the result (up to =)ofthe application 
ofr along (p to the mixture E; otherwise wix,l^x') = 0; (iv) Tlo{x) = Y^{ti'^{E') \ E' G Dom{Tl'^')r\x}. 

The stochastic semantics of a Kappa system ^ is then defined as the trace distribution semantics of 
the WLTS W^. 

3 Reduction procedure 

In this section, we assume, without any loss of generality that £[ and "Lx are disjoint sets. This can always 
be achieved by taking two disjoint copies yi and of ^ and using site names in to bear internal 
states, and site names in S^x to bear binding states. 

Informally, a contact map represents a summary of the agents and their potential bindings. 

Definition 13. ( Contact map) Given a Kappa system a contact map ( CM) is a graph object {-yV, S'), 
where the set of nodes ./K are agent types equipped with the corresponding interface, and the edges 
are specified between the sites of the nodes. Formally, we have that JV = {(A,r(A)) | A G is/} and 
S C {((A, 5), (A',/)) I A,A' G st^ and s G 2:(A),/ G 2:(A')}. If the site s of an agent of type A and the 
site s' of an agent of type A' bear the same binding state in the rhs Er of a rule, then there exists an edge 
e = {{A,s),{A',s')) G S' between s G Z(A) and s' G E(A'). 

We say that a site s of the agent a is tested by the rule r, if it is contained in the Ihs Eg of the rule r. 
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Definition 14. (Annotated Contact map) Given a Kappa system and its CM {J\^ a valid an- 
notated contact map (ACM) is a contact map where all agents are annotated with respect to 
the rule set The annotation on the agent of type A ^ with respect to the rule r is given by the 
equivalence relation on its set of sites ^{A) x E(A) such that: 

• If a rule r tests the sites si and site S2 of agents ai,a2 (it is possible that ai = a2) of type A, then 

• If a rule r creates an agent a of type A, then all the sites of'L{A) are in the same equivalence class, 
i.e. K,A= X r(A); 

Note that there can be several annotations of the agent type A G ^ which satisfy the conditions. More 
precisely, if the equivalence relation «a meets the condition, then so does any of its refinement. This 
allows to define the smallest such equivalence relation ~^ which we call the minimal annotation of 
agent A. An ACM is minimal whenever each agent type is annotated by its minimal annotation. 

Let r be a rule and an ACM which is valid with respect to the singleton {r}. Then for any agent 
type A G either A does not occur in the Ihs of r, or A occurs but all occurrences of A have an empty 
interface, or A occurs, tests some sites which all belongs to a same equivalence class C in r^a- In the 
latter case, we define test^^*^(A) = C, otherwise, we define test^*'*'(A) = 0. 

The meaning of the ACM is to summarize the dependences between sites that can occur during the 
simulation of a Kappa system. If the two sites s and / in the E(A) are correlated by the relation r^a, 
i.e. s ^ , it suggests that they are dependent in the following way. We must not aggregate in the same 
equivalence class any two states x and x' , such that they contain the agent A in a different evaluation 
of the sites s and s' . On the other hand, if the two sites s and s' are not correlated by k,a, then we 
may aggregate the states by the 'marginal' criteria, i.e. the condition which involves only one of the 
sites. Therefore, the less states are related by (~a)agj^' the better the reduction will be. To numerically 
justify this, we can imagine having an agent type A whose interface has n different sites s\,...,Sn, and 
each of them has two possible internal state modifications. Let us observe the two limiting relations k,a, 
i.e. ~A= {{siiSj) I 1 < j < n, 1 < y < n}, and ~a~ {{^i-i^d I 1 < ? < «}• The annotation r^a enforces at 
least to 2" states to describe all modifications of the agent A, whereas the annotation Ri^ suggests that it 
is enough to use only 2 • « of them. 

The ACM can be used to identify parts of Kappa species that we call fragments. 
Definition 15. (Kappa fragments) A fragment is the —equivalent class of a non empty irreducible pat- 
tern E such that: (i) the set of sites in the interface a of an agent A{a) in E is an equivalence class of 
~A> ( ii) sites can only be free or tagged with a binding label i G N and sites in Ej are tagged with an 
internal state in I, ( Hi) there is no occurrence of fictitious agent 0. 

We can use fragments to abstract the WLTS by identifying the mixtures which have the same 
(multi-)set of fragments. To reach that goal, we first overload the definition of = in order to identify 
mixtures having the same fragments. We introduce the binary relation =" as the smallest equivalence 
relation over patterns which is compatible with = and such that: 

A{a),A{c'),E =« A(tc <T',tE(A)\c <T),A(tc <T,tE(A)\c (y'),E 

for any agent type A G 0,0' interfaces, E pattern, and C an r^a -equivalence class of sites. For any 
set of sites X C the projection function fx over interfaces keeps only the sites in X, formally fx is 
defined by tx e = £, tx {si,o') = 5^, tx o' whenever seX, and tx {s^,o') =tx o' otherwise. 

Now we define the relation ~ which stipulates that the rule r\ applies on E\ along (^1 the same 
way as the rule r2 on E2 along More formally, we write (ri, £1,(^1) ~ (r2,£'2,<fc) whenever the 
following properties are all satisfied: 
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1- n = r2; 

2. El =^E2\ 

3. 02 = W° 01' where i/a is the permutation which tracks how the sub-interface ttest'*™^^ ^ (A;(a,)) is 
moved in the proof that E\ E2, for any agent A;(a,) occurring in Ei . 

More precisely, the transposition [|.^ J is associated to an agent permutation of the agents at position 
/ and / + 1; the transposition [2] is associated to a step which permutes the sub-interface test^*'*^(A) 
of two agents of type A, for any agent type A G ^2/ ; any other step is associated with the identity 
function (over N). The function i/a is defined as the composition of all the permutations (in the 
reverse order) which are associated to the elementary steps in the proof that Ei £'2. 

4. the result of the application of r\ to E\ along 0i is =-equivalent to the result of the application of 
r2 to E2 along 02- 

Definition 16. (Abstract WLTS of a Kappa system) Let = (tt^, {ri , . . . , r„}) be a Kappa system. We 
define the WLTS #^ = (jr,^,w,7fo) where: 

• ^ is the set of all =^ -equivalent class of mixture; 

• ^ is the set of all =y-equivalent class of triples {r,E,(j)) such that is an embedding between 
the Ihs Ei ofr and E; 

~ ~ k 

• A , jc') is equal to whenever there exist a mixture E, a rule r, and an embeddings such 

\[Et,E(]\ 

thatx= and X = [r,£',0]^j^ ; otherwise, w{x,X,x') is equal to Q; 

• for anyx£,£, ;f()(x) = LE'eDom{7i^^)nx ^oiE')- 

Let us define the relation ~ over ^ by [£"1]= ~ [£"2]= if. and only if, Ei E2 and the relation 
over ^ by [Ai]=^ ~ \Xt\=,^ if, and only if, X\ =^ A2. 

4 Abstraction 

We introduce abstractions on WLTS by aggregating the states and labels into partition classes. We 
obtain a new WLTS defined over the aggregated states and labels. Each non-trivial abstraction is a loss 
of information. However some of them are such that it is possible to do the stochastic analysis on the 
aggregates rather than on concrete states. We address the problem of characterizing when this is possible, 
and if so, how the weights in the abstracted system are computed. We also discuss the reverse process 
- given the abstracted system, and a particular probability distributions over the aggregates, whether we 
can make conclusions about the traces in the concrete system. We do the general theoretical analysis of 
the abstractions on WLTS, and afterwards we show the relation with the reduction of Kappa systems, 
that is presented in Sec. [3] 

Definition 17. (Abstraction) Consider a WLTS W = (J^T,^, w, TTq), and a pair of equivalence relations 
(~,~ jc) S X such that each ^-equivalence class and each ~ ^-equivalence class is finite. We 
denote the equivalence classes by x, I, and we write x ^ x, to indicate that x belongs to the equivalence 
class X, and I £ I to indicate that the label I belongs to the equivalence class I. 

A WLTS of the form W = (c^/^,^/^^,!^, ;fo), where Tt^^x) = ^{TroW | x G x} is called an abstrac- 
tion ofW, induced by the pair of equivalence relations (~,~^). Note that more abstractions can be 
induced by 'W , depending on how w is defined. 

Moreover, for any two cylinder sets of traces fiu G and %m. £ •^m(>^)> we say that Tir = 

xq 'H' x\ . . .Xk-i 'H* Xk is an abstraction o/tjk = xq xi . . .x^-i x^, and we write it Tir G Tm. 
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Definition 18. ( Sound abstraction: Aggregation ) We say that the abstraction W is a sound abstraction 
ofW, if the probability of any cylinder set of traces Tiir G =^^r(/^) is equal to the sum of the probabilities 
of all the cylinder sets of traces Tj^ G ^wJC^X whose abstraction is Tjr; 

^('^im) = Y^l^i^m) I Tim G ^r}- 



We introduce a function y : — )• ( =i?r — )• [0, 1] ) which assigns to each of the partition class x G 
a probability distribution over the states that belong to this partition class. The set of all such vectors 7 
we denote by F^-^ is defined as: 

{7I 7: JT/^ ^ (=r ^ [0,l])AV;cG Jr,£7(x,x) = 1}. 

We can think of the value y{x,x) as the conditional probability of being in the state x, knowing that we 
are in state x, i.e. Pr(X, =x \ Xf Gx) = y{x,x). We note that, when thinking of 7 as the conditional prob- 
ability, it should be a time-dependent value. However, we refer to 7 as to a single, constant distribution. 
This will be justified in Lem. [T] 

Definition 19. (Complete abstraction: Deaggregation) We say that the abstraction W is a complete 
abstraction of 'W for 7G F,^ if the following holds. Given the probability of an arbitrary abstract 
cylinder set of traces of length ^ > 1, that ends in the abstract state xi (written fjig — t- x^^), we can 
recompute the probability of ending the trace in the concrete state Xk G Xk in the following way: 

7r(fM Xk) = y{xk,xk) ■ n{Tm xk). 

Sound and complete abstractions W cannot be induced by any pair of relations (~,~ j^), because 
there might not exist a weighting function w : x ^/^^ x — )• M, such that the conditions from 



Dfn. 18 and Dfn. 19 are met. Moreover, even if such w exists, the remaining question is whether the 
information on the abstract system is enough to compute them. 

We now restate the main Theorem from |[T2l . that the abstractions for Kappa systems, that we re- 
sumed in Sec.[3j are sound and complete. 

Theorem 1. (The abstraction induced by the ACM is sound and complete) Given a Kappa system M = 
{tI'q , {n, •••,''«}), and an ACM (c/K, S'), an abstraction = w, TTq) induced by the partition 



classes C x ^ , as proposed in the Def. 16 is a sound and complete abstraction of the 



Wi^ = {S^ ,w,TIq), provided that for any two mixtures M and M' such that M =^ M', we have: 

TtoiiM]^) • |[M',M']| = noi[M%) • |[M,M]|. 

We consider a mixture M. We denote by x €z ^ the equivalence class [M]=, and by x £ the equiva- 
lence class [M]=j = [x]^. The conditional probability 7(x,x) is computed as the ratio of the number of 
automorphisms of x (embedding between x and x) and the sum of the number of automorphisms of any 
^-equivalent state. Thus we have: 

,^ X \\x,x\\ 



The reader can find the detailed proof in |[T2l . 
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4.1 Lumpability 

Now we define different versions of lumpability and investigate the relationship with sound and complete 
abstractions. 

Definition 20. (Lumped process) Given a WLTS W = (^,^,w, tTq), where = {xi,X2, ■■■}, and a 
partition ~C ^ x ^ on its state space, we observe the continuous-time stochastic process {^f},g]R+, 
that is generated by W (Dfn. |2|. We define the lumped process {7^} on the state space S^j^^ = {xi,X2,---} 
(denoted by capital indices, i.e. xj, xj) and with initial distribution Kq, so that 

Pr{Y, =xj I ^0 = ^o) = ^r{Xt G | Xq G fo)- 

The lumped process is not necessarily a Markov process. 

Definition 21. (Lumpability) Given a WLTS W = (,^,^,w, ttq) that generates the process {Xf}, we say 
that it is lumpable with respect to the equivalence relation ~C ^ x ^ if and only if its lumped process 
{7;} has the Markov property. 

The evolution of a process depends on the initial distribution, and so does the lumpability property. 
We thus define the set of initial distributions, for which the lumpability holds. We denote the set of all 
probability distributions over as : 

^g^ = {K\ 7r:^^[/,oo]and £ 7r(§))=-}. 

Moreover, we denote the set of initial distributions that produce a chain lumpable with respect to the 
given equivalence relation ~ by 3^-^ ^ : 

r^:= {k I the lumped process initialized with n is lumpable with respect to ~}. 

Whenever a distribution n G .S^^f; is positive on the equivalence class x, i.e. L{7r(x) | x G x} > 0, 
we denote by 7r|,v(x), the conditional distribution over the states of x: n\x{x) = 7i{x)/n{x), when x G x, 
and 7l\x{x) = 0, otherwise. 

Definition 22. (Strong and weak lumpability) Given a WLTS "W = ,^,w,7l(i) that generates the 
process {Xt}, and an equivalence relation ~C ^ x we say that {X,} is: 

• strongly lumpable with respect to ~, if the lumped process {7^} is Markov with respect to any 
initial distribution, i.e. ^ = 

• weakly lumpable with respect to ~, if there exists an initial distribution that makes the lumped 
process {F,} Markov, i.e. S^"^ ^ 7^ 0- 

Note that the definitions of strong and weak lumpability involve the quantifiers "for all" and "exists" 
over the probability distributions over a set of states. Thus, checking for either of them involves in general 
an infinite number of checks. People have given sufficient conditions of strong and weak lumpability on 
discrete-time Markov chains (DTMC's) |[2ni26i] . The results had been extended to the continuous-time 
case ||4l|27l. We rephrase the sufficient conditions stated therein. 

In order to understand the sense of the weak lumpability characterization, we discuss the meaning 

of 7. We recall the semantics of a WLTS 'W by observing the cylinder sets of traces, i.e. Tjir = xq 
x\ . . .Xk-i Xk G =^r(>^). The abstraction W of W, induced by (~, generates an abstract cylinder 
set of traces, denoted Tm = xq 'H^ xi . . .x^-i x^ G ^r{W). 
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For any cylinder set of traces G ^r{'^), we denote by y^^^ the distribution of the conditional 
probabilities over the lumped state x/^, knowing that the abstract cylinder of traces Thr, which ends in the 
abstract state x^, was observed, i.e. 



K 

The definition of the complete abstraction suggests that, if y^j^ was independent of the traces on which it 
is conditioned, i.e. Xm, then the completeness would hold. 

Theorem 2. (Lumpability on CTMCs) Let us observe a WLTS W = ,^,w,Ko) that generates the 
process {Xt}, and an equivalence relation ~C ^ x We consider the rate matrix R : x ,^2^ — >• M. 
If the lumped process is Markov, then we denote its rate matrix by R : x — J- M. Then we have 
the following characterizations about the lumped process 

• If for all x,| ,Xj2 € ^ such that x,-, ^ xi^, and for all xj G ^j^, we have that 

R{xi,,Xj)= R{xi^,Xj), (2) 
Xji^xj Xjexj 

then {Xf} is strongly lumpable with respect to ~; We have: 

=^{/?(x;(,X^) I Xj£Xj}; 

• If there exists a family of probability distributions over the lumped states, y G ^,%\r^, such that for 
allxj^^Xj^ G ^ such that xj^ ^ xj^ and for all xi G we have that 

a(x - )-a(x ) and ^^-g^'-'^^^''^-^') - ^^iexj^i^i^^ji) 

a[x,j-a[xj,)and ^^^^^^^^^ - ^^^^^^^^^ , (3) 

then 

1. If the distribution y is in accordance with tiq, i.e. TToLaT/^ = 7> then for any finite sequence of 
states (xo, . . . iXk) G ^^^^ and any sequence of time intervals (/i , . . . ,4) G IM*^, we consider 
the set Tm of the traces of the form Xq Xj . . -x^ ^ k,t\+^-+tk ^.^ <i <k and 
Xi ~ X-, and for all i, I <i <k, tt G /( and U G we have that: ifK^fm.) > then yfj^ = y. 
In other words, knowing that we are in state x/, the conditional probability of being in state 
X G X/ is invariant of time. 

2. The process {Xf} is weakly lumpable with respect to ~. Moreover, we have: 



R{x[,xj) 



T,{Rixi,xj^) I Xi G x/}^ 



One shall notice that Thm. |2] gives a weaker condition than the completeness of WLTS abstraction 



(eg see Dfn. 19 1. The main reason is that we do not 'track' transition labels, in the sense that we observe 
the abstraction on the cylinder sets of traces induced only by ~, and not also by ~l- Yet, in the particular 
case when states fully define the transition labels (ie, if w{xi,li,x\) > 0, w{x2, 12,^2) > 0, xi ~ X2, and 
Xj ~ Xj, then /i h), the given condition for weak lumpability coincides with the definition of the 
complete abstraction of WLTS. 

The characterization of weak (resp. strong) lumpability given in Thm. [2] is sufficient, but not a nec- 
essary condition: there exist systems which are strongly or weakly lumpable, but do not satisfy the 
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conditions given in the theorem. Interestingly, there are systems, such that the characterization from 
Thm. |2] would detect as strong, but not weakly lumpable, which is counter-intuitive with the terminol- 
ogy. One shall also notice that the conditions of Thm. [2] imply that: in order to aggregate two states in 
the CTMC, they must not have different waiting times until the next transition (e.g. they should have the 
same activity). It is stated explicitly in the characterization of weak lumpability and it can be obtained 
by summation over the outgoing class in the characterization of strong lumpability. 

We consider a WLTS W = (^, , w, ttq), and the set of all equivalence relations ~ on denoted 
PT^ . We introduce the subsets of PT^, denoted PS, PW, CS, CW in the following meaning: (i) PS -the 
set of all equivalence relations such that {X,} is strongly lumpable with respect to ~; (ii) PW - the set 
of all equivalence relations such that {X,} is weakly lumpable with respect to ~; (iii) CS - the set of all 
equivalence relations such that {Xt} satisfies the condition for strong lumpability given in the Thm. [2| 
(iv) CW - the set of all equivalence relations such that {Xt} satisfies the condition for weak lumpability 
given in the Thm. |2] 

Lemma 1. (Relations on lumpability properties and conditions) Consider an arbitrary WLTS W = 
,^,w, TTo) and the equivalence relation ~C ^ x We have the following relations: (la) if^G PS 
then ~G PW, if ^G CS then ~G PS; and if ^£ CW then ~G PW; The converse implication does not 
hold for any of the statements; (2a) If ^G CW, that does not imply ~E CS; (2b) If ^G CS, that does not 
imply ~G CW. 



Proof. The statement in the part (1) trivially follows from the Dfn. 21 and Thm.|2] To show (2a) and (2b), 



we consider the WLTS W specified in the Fig. 1(a) with the state space ^ = {x,yl,y2,y^,Z\,Z2,z■i}. 
Let ~i be an equivalence relation on , such that y^ ~i 3^2 and zi ~i Z2- By lumping the states by ~i, 



we get the system W\, as shown in Fig. |l(b)| It is easy to check that ~iG CS. Moreover, we have that 
~iG CW , since for 

' X y\2 B zn Z3 
1 (0.5,0.5) 1 (0.5,0.5) 1 

the weak lumpability condition is satisfied, so ~ie CSHCW. We further lump the states yi2 and 3^3, by 



taking the transitive closure of the relation ~i union (y^ja), denoted ~2= fc(~i U(ji,>'3)) (Fig. 1(c) 1 
This lumping is such that ~2^ CS because we have 



On the other hand, for 



3^1 but w(ji,/,zi2) > 0, and w(j3,/,zi2) = 0. 



X ym z\2 Z3 

1 (1/3,1/3,1/3) (0.5,0.5) 1 



we argue that ~2£ CW- Therefore, if the initial distribution is in accordance with 7, the abstraction #2 
is sound and complete. 



If we rather lump zi andz2,by ~3=fc(~i U(zi,Z3)),we get the system #3 (Fig. 1(d) I. This system 
is such that CS\CW . More precisely, we cannot find a 7 which would witness ~3G CW: if such 
a 7 existed, we would have 7({x})(.x) = 1, and consequently 7(^12) = (0.5,0.5), and 7(3^3) = 1. This 
implies that the conditional distribution 7(zi23) cannot be invariant of time - it will alternate between the 
distributions (0,0, 1) and (0.5,0.5,0), depending on the choice made in x. Note that, since ~3G CS, it 
follows that ~3G PS, and this implies ~3G PW . □ 

This discussion indicates that if we decide to check for weak lumpability instead of for strong by us- 
ing the characterization from Thm.|2} it might happen that we eliminate the aggregations that are strongly 
lumpable. In the case of reductions of Kappa systems, we will use the weak lumpability characterization. 
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4.2 Bisimulations 

Aiming to define tlie algoritlim that is abstracting the WLTS of a Kappa system, we start by redefining 
the lumpability properties in the bisimulation notions. Bisimulation is typically defined by logically 
characterizing the distinguishing property of the states that may be aggregated. 

We define three kinds of bisimulation relations on the WLTS, which are based on the lumpability 
characterizations given in Thm. |2] We adopt the terminology of [i5J. The forward bisimulations arise 
from the characterization for strong lumpability: the bisimilar states have the same forward behaviour in 
the sense that they are each targeting any other lumped state with the same total affinity (total outgoing 
rate). This concept is well established for dependability or performance analysis |[T6l[T5l . What we use in 
the abstractions of Kappa systems is backward bisimulation. The bisimilar states have the same backward 
behaviour in the sense that they are reached by the predecessors from one lumped state with the same 
probabilistic quantity, which becomes the rate in the abstract system. It is however less established and 
only applied in very few approaches for stochastic modelling f30]. The backward uniform bisimulation 
is an instance of a backward bisimulation with an additional constraint that only the equally-probable 
states may be aggregated. 

Definition 23. Given a WLTS W = ,^,w, TTq), and (~, ~l) a pair of equivalence relations respec- 
tively over X and we define a function 5f : ^ x -2/^^ x — )• W^: 

5f{xi,l,Xj) = ^{\w{xi,l,Xj)\ I / G / andxj £ xj}. 



Furthermore, given a family of probability distributions over the partitions y € F;?;^, we define the 

Y,{y{xi,Xi)-\w{xi,l,Xj)\ I l£l,Xi£Xi} 



quantity 5b : x ^/^^ x — ;> R^: 



dB{xi,l,Xj) 



r{xj,xj) 



Specifically, if we have that y is a uniform distribution over the equivalence classes, we can express 
the latter expression in terms of cardinalities of the equivalence classes: 

^ \x '\ ^ 

5BuiXi,l,Xj) = -r^Y,{\w{Xi,l,Xj)\ I I G l,Xi G Xi} . 
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Definition 24. (Forward and backward Markov bisimulation) Given a WLTS W = ,^,w,7io), and 
(~5 ~l) fl' pciif of equivalence relations respectively over and we say that (~, ~i) is a 

1. Forward Markov Bisimulation, if for all x, and xj, the following is satisfied: ~ xj, iff for all 
equivalence classes x G ^/^,l £ -^l^^^' have that a{xi) = a{xj) and 6p{xi,l,x) = 5f{xj,l,x). 
Remark. Note that this involves the bisimulation in the classical sense: if Xi has a successor in 
some class, xj has it as well, and they are related by appropriate labels ( and probabilities in this 
case). 

2. Backward Markov bisimulation, if for all and xj, there exists an y^z F ^, such that the follow- 
ing is satisfied: x,- ~ xj, iff for all equivalence classes x G Xj^, I € ^/^^. have that a(x;) = aixf) 
and 5b(x, /,x,) = 5b(x, Z,Xy). 

Theorem 3. (Forward Markov bisimulation implies sound abstraction) Let W = {^9^ ^w^iU)) be a 
WLTS. If {^t^l) induces a forward Markov bisimulation, then for any aggregates Xi, I, and xj, we can 
define 

w(Xi,/~Xy) = 5F{xi,l,Xj). 

The so defined abstraction 'W = ( J^^- ' -^1-^ ' ^' ^"-^ sound. We then say that 'W refines 'W by a forward 
Markov bisimulation written 'W (^ ,^^) 'W ■ 

Theorem 4. ( Backward Markov bisimulation implies sound and complete abstraction ) Given a WLTS 
W = ,w,71q), if induces a backward Markov bisimulation with conditional probabilities 

over the aggregates y £ r_^^, then for any aggregates Xi,l, andxj, we can define 

w{xi, I, Xj ) = 5b (x, ,l,Xj). (4) 

IfY{x) = 7lo\x, then the so defined abstraction W = (^^,^/^^ , w, TTq) is sound and complete. We then 
say that W refines W by a backward Markov bisimulation with conditional distributions y, 

written W ^B,(^.^^).y 'W . 

In particular, if we know that y is uniform, the equation (j^J) becomes w{xi,l,Xj) = 5Bu{xi,l,Xj), 
written also W 

4.3 Proving bisimulations 

The forward bisimulation relation for abstracting the transition systems with CTMC semantics has been 
established and used in applications (eg, [il6ul5 |). Moreover, computing the backward uniform bisimu- 
lation when 7 is uniform is defined 1511301. It is based on an alternative characterization of the backward 
uniform Markov bisimulation, which eases the analysis. 

Lemma 2. (Proving backward uniform Markov bisimulation) Let = (,^,.^, w, TTq) be a WLTS and 
be a pair of equivalence relations respectively over ^ and J£. For any state x' G and any 
pair of classes x,l £ x -S^/^^, let us define the set Pred(x, /,x') of transitions from a state in x to the 
state x' and with a label in I as follows: 

Pred(x,f,x') = {(x,/) G x x f | w(x,/,x') > 0}. 

Assume that: (1) TIq]^^^^ = Kq, and (2) for any x^i^y^j S ^ such that x- ~ Xy and any x E / E 

■^/r^L, there exists a bijective map ^ between Pred(x, /,x-) and Pred(x, /,Xy), such that for any {xi,li) E 
Pred(x,Z,x-), i/'0(xi,/,) = {xj,lj), then we have that w{xi,li,x[) = w{xj,lj,x'j). 

Then we have that W is the backward uniform bisimulation of the abstraction W = ( , ,w,7io), 
i.e. W ^Bu,{^,^,) 
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w ^ w 



5b,(~.~i,),7 ,r 



Figure 2: Proving backward refinement 



On the other hand, as soon as 7 over the aggregates is not uniform, we cannot observe the bijection 
between predecessors over the states. Proving that the given abstraction is a backward bisimulation 
cannot be established unless we have a right 'guess' of the distributions 7. Lem. [3] states how to avoid 
proving backward bisimulation by instead proving two uniform backward bisimulations. More precisely, 
if we want to prove the backward refinement between the systems W and W, it is enough to observe the 



system W, which is a backward uniform refinement of both of the systems W and W (Fig. 4.3 1. 



Lemma 3. (Proving backward Markov bisimulation) Consider a WLTS If = ,^,w,71q), and any 
aggregation relation such that W = {^/^,^^^^,w,7to). We assume that there exist a sys- 

tem W = ,w' ,71q), and the two pairs of equivalence relations (~2;~L2). such that 

^fif/,(^,.^^i) ^Bj/,(^2,~L2) ~i^~2 (in the sense that, for any x\,x'2 G x\ ~i x'2 =^x\ ~2 

X2), ~Li ^~L2. <^nd, for any x\,X2 such that x\ ~2 ^2, the number of states which are equivalent to 
x\ is equal to the number of states which are equivalent to Xj. Under this assumption, we have that 

:^B.(r^.r^L),Y where 7 is defined as 

|{x'e JT' I x'^i4}| , 

y{x,x) = . . . p- Jorany [xq\^i =x. 

\{X' e ^' I X' '^2Xq}\ 

This Lemma contains the key observation for the abstraction of Kappa systems, and for proving 
Thm{T] It thus completes the intention of the theoretical analysis in this paper. More precisely, we 



observe the WLTS W of a given Kappa system as defined in Dfn. 12 and its abstraction generated 



as proposed in the reduction procedure (Sec. [3} Dfn. 16 1. The main observation is that the system W 



is already an abstraction. More concretely, the states of W are multisets of species, and as such, they 
abstract the individual species. For example, a state that contains two agents of type A(5„) abstracts away 
the potential individual behavior of these two agents, for example Ai{su) and A2{su)- To show that the 
abstraction is sound and complete, we observe the system W, which is the individual-based semantics of 
a Kappa system, where each individual agent is uniquely identified. The backward uniform refinement is 
established between W and W by the modeling assumptions. We are left to prove the backward uniform 



refinement between W and W. This is done by inspections on the ACM's (Dfn. 14 1 



4.4 Example 

We consider the following Kappa system. We have the agent types £/ = {A,B}, the site names {s,t}, the 
signatures £, (A) = £, [B) = {s} and (A) = (B) = {?}, the alphabet of internal states I = {u,p}. The 
contact map is defined by such that ^ = {{A,s),{A,t),{B,s),{B,t)} and S" = {{{A,t),{B,t))} 

and the following rules: 

rl :A(5j ^A{sp)@ki,ki^ 

r2:B{s,^)^B{sp)@k2,k2- 

r3:A{t) , B{t) ^A{t^) , B{t^)@k^,k^^ 
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Moreover, using the minimal ACM for annotating the agents, as written in Dfn. [14} we get that has 
two equivalence classes {s} and {t}; and that «5 has two equivalence classes {s} and {t} as well. 



The fragments derived from an ACM (Dfn. 15 1 are the following: F\ = A(5„), F2=A [Sp) ,F3=A{t), 
F, =A{t') , B{t'), Fs = B{s^), Fe=B{sp), Fj = B{t). 

Let us pick a (finite) initial distribution ttq- Now we observe the WLTS W = {^,^,w, TTq) assigned 



to the Kappa system (introduced in Dfn. 12 1, and the state y which is the =-equivalence class of the 
mixture By defined as follows: 

A{sp/) ,B{sp/) ,A{su/) ,B{s,/) ,A{su/) ,B{s,/) 

The unique (up to =) non -equivalent mixtures is Eyj which is defined as follows: 

A{sp/) ,B{s,/) ,A{su/) ,B{sp/) ,A[su/) ,B{s,/) 

We denote y' = [£"/]=, y' = [£'y]=t- We compute however that the distribution among state y = [Ey]=i. 

We have: Y{y,y) = 1/3 and Y{y,y') = 2/3. Roughly speaking this comes from the fact that if we 
annotate fragments of type A and B iny with the identifiers 1, 2, 3 (there are 36 possible annotation), and 
if we assume that agents with the same identifiers are bound together. Then there are 12 annotations so 
that the phosphorilated A and B are bound together, and 24 where this is not the case. A more detailed 
analysis of this model is given in |[T2l . 



5 Case study 

In this section, we apply our framework to a case study. We have indeed refactored in Kappa the model 
of a crosstalk between the EGF rececptor and the Insulin receptor pathways which is described in ||6l. In 
this model, two kinds of receptors, EGF receptor (EGFR) and insulin receptor (IR) can recruit a protein 
called Sos, which can be phosphorylated, or not. Each kind of receptor has its own pathway, and these 
two pathways shared some common proteins. 

The contact map is given in Fig. |3] One can notice that some sites can be bound to several other 
sites, which denotes a competition (concurrency). Moreover, the site dofa. EGF receptor (EGFR) can be 
bound to the site d of another EGFR (since there is a loop in the CM). Moreover rules are given in Table 
[T] We do not give the rate for rules, but we assume no hypotheses on the rates (rates are parameters, 
some of them might be equal, or not). Moreover, each rule is reversible. 

We roughly explain how each pathway works, by focusing on the forward direction of rules. First, we 
describe how EGFR can recruit a transport molecule (Grb). EGFR can recruit a ligand (EGF) on site a 
(r01,r02), and two EGFRs can form a dimer (r03,r04,r05). We have used two rules to encode EGF/EGFR 
binding, in order to model the fact that the rate of association may depend on the fact whether EGFR is 
in a dimer, or not. The same way, the rate of dimer formation/dissociation may depend on the number of 
ligands that are bound to receptors. The site b of EGFR can be phosphorylated (r06,r07) at a rate which 
depends whether the receptor is in a dimer, or not. Then, EGFR can recruit an adapter molecule called 
She (r08). Then, EGFR can phosphorylize She (r09,rI0) (the rate depend on the fact whether the receptor 
is still in a dimer, or not). She can then recruit a transport modecule (Grb) (rll). Yet, each receptor has 
a shorter way to recruit a transport molecule. The site c of EGFR can be phosphorylated (rl2,rl3), and 
then recruit Grb directly (rl4,rl5). 

Then, we describe how an insulin receptor (IR) can recruit Grb. IR can recruit insulin molecules (Ins) 
on two sites a (rl6,rl7) and b (rl8,rl9) (the rate may depend on the fact whether an insulin molecule 
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Sos 




Figure 3: Contact map for the SGF/^/Insulin crosstalk. 



rOl: EGF{a) , EGFR{a,d) < — > EGF[a^) , EGFR[a\d) 

r02: EGF{a) , EGFR{a,d-) < — > EGF[a^) , EGFR{a\d-) 

r03: EGFR(a,d) , EGFR(a-,d) < — > EGFR(a,d^) , EGFR{a-d') 

r04: EGFR{a,d) , EGFR(a,d) < — > EGFR(a,d^) , EGFR{a,d^) 

r05: EGFR{a~,d) , EGFR{a~,d) < — > EGFR{cr,d^) , EGFR{a-,d^) 

r06: EGFR(h^,d) < — > EGFR(hp,d) 

r07: EGFR(h^,d') < — > EGFR{bp,d-) 

r08: EGFRlbp) , Shc{a) i — > EGFR{h],) , 5/ic(a') 

r09: EGFR[bl,d) , Shc{a\b^) < — > EGFR(b]„d) ,Shc{a\bp) 

rlO: EGFR[bl,d~) ,Shc{a\b^) i — > EGFR{b]„d~) ,Shc{a\bi,) 

rll: Grb{a) , Shc{bp) < — > Grb{a') , Shc{b],) 

rl2: EGFR(c^„d) < — > EGFR[cp4) 

rl3: EGFR{c^,d-)< — >EGFR[cp4~) 

rl4: EGFR[ci„d) , Grb{a) < — > EGFR(c]„d) , Grb(a^) 

rl5: EGFR[ci„d~) , Grb{a) i — > EGFR(c]„d~) , Grb[a^) 

rl6: IR(a,h) , Ins(a) i — > IR(a\b) , Ins{a') 

rl7: IR{a,b~) ,Ins(a) i — > IR{a\b-) ,Ins(a^) 

rl8: IR{a,b) , Ins(a) < — > IR{aM) , Ins(a^) 

rl9: IR{a-,b) , Ins(a) i — > IR{a-b') , Ins(a') 

Table 1 : Rule set for tl 



r20: IR{a,b,c„) i — > IR{a,b,Cp) 

r21: IR{a-b,cJ< — >IR{a-b,Cp) 

r22: IR(a,b-cJ < — > IR[a,b~,Cp) 

r23: IR{a-b~,c„) < — > IR{a-,b-,c p) 

r24: IR[cp) , Shc(a) i — > IR{c],) , Shc[a^) 

r25: IR{a-b~c^) ,Shc[a\b}j < — > IR{a',b'c^) ,Shc{a\bp) 

r26: IR{a,b,du) < — > IR[a,b,dp) 

rll: IR{a-b,d^) i — > IR(a-,b,dp) 

r28: IR{a,b~,du) < — > IR{a,b-,dp) 

i29: IR{a-b~du) < — > IR{a-,b-,dp) 

r30: IR{dp) ,lRS(a)i — > IR{dl) ,IRS(a^) 

r31: IR[a-,b-,d^) ,IRS{a\b}) i — > IR{a-,b-,d') ,IRS{a\bp) 

r32: Grb(a) , IRS{bp) i — >■ Grb{a^) , IRS{b'p) 

r33: Grb{b) , Sos(d^) i — > Grb{b') , Sos{dl) 

r34: Grb{b) , Sos{dp) < — > Grb(b^) , Sos{d],) 

r35: Grb{b^) ,Sos{dl) < — > Grb{b^) ,Sos{dl) 

r36: Sos(d^i) i — > Sos{dp) 

r37: Shc(b^,) < — > Shc{bp) 

r38: IRS{b^) < — > lRS{bp) 

; FGF/?/InsuIin crosstalk. 



has already been recruited). The site c of the IR can be phosphorylated (r20,r21,r22,r23) at a rate which 
depends on the number of recruited insulin molecules (in practice the rates of rules r21 and r22 are the 
same). Then, IR can recruit an adapter She (r24). Whenever IR is also bound to two insulin molecules, 
She can be phosphorylized (r25). She can then recruit Grb (rll). Yet, IR has an other way of recruiting 
a Grb. The site d of IR (r26,r27,r28,r29), and then recruit another adapter called IRS (r30) which can be 
activated when the insulin receptor is bound to two insulin molecules (r31). Then, IRS can recruit Grb 
(r32). 

Independently, Grb can recruit a protein Sos (r33,r34). And Sos can be activated (r35,r36) at the rate 
which may depend on the fact whether it is bound to a Grb, or not. Other rules describe the recuitment 
of Sos by Grb. And spontaneous (de)phosphorylation of She (r37) and IRS (r38). 

In this model, 2,768 different complexes may occur. This number is mainly due to the fact that each 
dimer made of two proteins EGFR has 4 sites (the sites b and c for each EGFR) to recruit a Grb, which 
induces a small combinatorial blow up. Scanning the set of rules, one can notice that no rule tests both the 
site a and b of some proteins Grb. Thus, the partition {{a}, {^}} can be used safely for the sites of Grb, 
in the annotated contact map. As a consequence, the number of fragments is only 609. Unfortunatly, 
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this is the only reduction that we can do (ie, the partition for the sites of any other kind of proteins, is the 
coarsest one). 

Last, one can notice that, given some additional hypotheses on the rate of some rule, that the sites a 
and b of IR have a symetric role in the system. We could consider this symetry to reduce the set of consid- 
ered fragments, by identifying two symetric fragments, such as IR{a\b) , Ins[a^^ and IR[a,b^) , Ins[a^). 

6 Conclusions 

Reducing the complexity of combinatorial reaction mixtures is an important milestone towards simula- 
tion and analysis of large-scale realistic models of cellular signal transduction. In this paper we study a 
scalable reduction method, that is applicable to any rule-based specification. The reduction is sound and 
moreover complete, i.e. the sample traces of individual molecular species can be reconstructed from the 
traces of aggregated species in the reduced model. We put this method into the general context of abstrac- 
tions of probabilistic transition system and show that it yields a sufficient condition for weak lumpability 
and that it is equivalent to backward Markov bisimulation. The reduction factor depends on the number 
of independent molecular events and is strictly smaller than that of the less-demanding reduction based 
on the differential semantics. 

A compelling problem for future work is thus to analyze differential fragments in the context of 
stochastic semantics and to obtain error bounds for this reduction as a function of the kinetic parameters 
of the system. 
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